Inflationary perturbation theory is geometrical optics in phase space 
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A pressing problem in comparing inflationary models with observation is the accurate calculation of 
correlation functions. One approach is to evolve them using ordinary differential equations ("transport 
equations"), analogous to the Schwinger-Dyson hierarchy oiin-out quantum field theory. We extend this 
approach to the complete set of momentum space correlation functions. A formal solution can be obtained 
using raytracing techniques adapted from geometrical optics. We reformulate inflationary perturbation the- 
ory in this language, and show that raytracing reproduces the familiar "5JV" Taylor expansion. Our method 
produces ordinary differential equations which allow the Taylor coefficients to be computed efficiently. We 
use raytracing methods to express the gauge transformation between field fluctuations and the curvature 
perturbation, f, in geometrical terms. Using these results we give a compact expression for the nonlinear 
gauge-transform part of / NL in terms of the principal curvatures of uniform energy-density hypersurfaces 
in field space. 



I. INTRODUCTION 

Our current theories of the early universe are stochas- 
tic. They do not predict a definite state today: rather, their 
predictions are statistical. To compare these predictions 
with observation it must usually be supposed that we are in 
some sense typical. This brings two challenges. First, what 
is typical under some circumstances may be atypical under 
others. Therefore we must be precise about the type of ob- 
server of which we are a typical representative. This leads 
to the "measure problem," about which we have nothing 
new to say In this paper we are concerned with the second 
challenge: after fixing a class of observers, to estimate the 
observables typically measured by its members. 

Inflation is the most common early-universe paradigm 
for which we would like to compute observables. In this 
context we usually take ourselves to be ordinary observers 
of the fluctuations produced on approach to a fixed vac- 
uum. The challenge is to calculate the typical stochastic 
properties of these fluctuations. 

The most important fluctuation generated by inflation 
is the primordial density perturbation, f . Correlations in 
the temperature and polarization anisotropies of the mi- 
crowave background are inherited from £ and provide a 
clean probe of its statistical character. Therefore, both 
present-day constraints uJ] and the imminent arrival of 
high quality microwave-background data []2[] make accu- 
rate estimates of its statistical properties a pressing issue. 
Meanwhile, large surveys of the cosmological density field 
will provide information about its properties on comple- 
mentary, smaller scales [@] . To compare this abundance of 
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data to models we require an efficient tool with which to 
estimate the n-point functions (£"). 

Taking ( to be synthesized from the fluctuation of one 
or more light scalar fields during an inflationary era, sev- 
eral computational schemes exist which enable the n-point 
functions to be studied. Many of these schemes employ 
some variant of the separate universe picture Jfj]. Taking 
H to be the Hubble parameter, this asserts that — when 
smoothed on some physical scale, L, much larger than the 
horizon scale, so that L/H -1 ;> 1 — the average evolution 
of each L -sized patch can be computed using the back- 
ground equations of motion and initial conditions taken 
from smoothed quantities local to the patch. Working from 
a Taylor expansion in the initial conditions for each patch, 
Lyth & Rodriguez showed how this assumption could be 
turned into a practical algorithm for calculating correlation 
functions Q . This "5N method" has become the most pop- 
ular way to explore the predictions of specific models, both 
analytically and numerically, and has developed a large lit- 
erature of its own. The principal difficulty arises when cal- 
culating the coefficients of the Taylor expansion, sometimes 
called the "5N coefficients." We shall discuss this difficulty 
in flVBl 

Alternative approaches exist. Rigopoulos, Shellard & van 
Tent $Q\ evolved each correlation function using a Langevin 
equation. Yokoyama, Suyama & Tanaka 0] decomposed 
each 5N coefficient into components which could be com- 
puted using ordinary differential equations. Later, a sys- 
tematic method to obtain 'transport' equations for the en- 
tire hierarchy of correlation functions (rather than simply 
the 5N coefficients) was introduced [8]. A more longstand- 
ing approach uses the methods of traditional cosmological 
perturbation theory ("CPT") to produce "transfer matrices" 
113] • This has recently been revived by a number of au- 
thors [fltMHT l . Numerical approaches have been employed 
by Lehners & Renaux-Petel [fia l , Ringeval [16;] , and Hus- 
ton & Malik ifiT 



The relationship of these different methods to each other 
has not always been clear. Nor is it always obvious how to 
relate the approximations employed by each technique. In 
this paper we study the connections between many of these 
approaches using the formalism of Elliston et al. lfl8ll . This 
is a statistical interpretation of the separate universe pic- 
ture. In what follows we briefly summarize the construc- 
tion. (See also Ref. flU].) 

The separate universe approximation as statistical 
mechanics. — Fix a large spacetime box of comoving side 
;u containing the region of interest. The scale n should 
be much larger than the separate universe scale, requir- 
ing jj, » L, but not superexponentially larger BlSll . After 
smoothing on the scale L, the fields within the large box 
pick out an ensemble or cloud of N ~ (mA) 3 points in 
the classical phase space. The condition that n/L is not 
superexponentially large means that the typical diameter 
of the cloud will be roughly of order the quantum scatter 
{5cp 2 ) 112 ~ H. Because N is still large, N » 1, it is con- 
venient to describe the ensemble by an occupation prob- 
ability p on phase space Q The correlation functions of f 
on the scale L are then determined by the classical statis- 
tical mechanics of this ensemble, which is encoded in the 
Boltzmann equation. 

In familiar applications of statistical mechanics, the evo- 
lution of the ensemble may be complicated. Small-scale 
interactions scatter members of the cloud between orbits 
on phase space, represented by the collisional term in the 
Boltzmann equation. However, the separate universe as- 
sumption requires causality to suppress those interactions 
which would be required for scattering between orbits. 
Therefore the evolution is trivial. Each point in phase space 
is assigned an occupation probability by the initial condi- 
tions, which is conserved along its orbit. All that is required 
is a mapping of initial conditions to the final state, which is 
obtained by carrying the initial conditions along the phase 
space flow generated by the underlying theory. It follows 
that the Boltzmann equation can be integrated using the 
method of characteristics. 

A similar conclusion applies to the correlation functions 
of interest, (£"). These probe information about the distri- 
bution function over the cloud, giving a weighted average 
over many characteristics. Alternatively, if the cloud has 
only a small phase-space diameter, we can exchange infor- 
mation about the entire set of characteristics for the details 
of a single fiducial characteristic and a description of how 
nearby characteristics separate from it. In differential ge- 
ometry this description is provided by the apparatus of Ja- 



1 To be certain that we are estimating only the observables measured by 
a typical observer living within a single terminal vacuum, we should 
demand that p has support only on points whose orbits eventually con- 
verge in some neighbourhood of that vacuum. This requires that all 
horizon volumes reheat almost surely in the same minimum. If some 
horizon volumes reheat in different minima then the resulting correla- 
tion functions are not measurable by a local observer who sees only a 
single vacuum. 



cobi fields; see Fig.rjJ We shall see that the differing imple- 
mentations of the separate universe approximation can be 
understood as alternative methods to compute these Jacobi 
fields. 

In applications we are frequently interested in correla- 
tion functions associated with mixed scales, rather than a 
single scale L. To do so we construct multiple ensembles 
associated with different smoothing scales. The separate 
universe approximation couples the evolution of all these 
ensembles in a specific way, which we describe in fllTTl 

Outline. — In this paper we develop and refine the 
statistical-mechanical interpretation of the separate uni- 
verse picture summarized above. Because the final distri- 
bution of occupation probabilities is an image generated by 
dragging along the phase space flow, it can be calculated in 
precisely the same way that geometrical optics enables us 
to calculate the image generated by a source of light rays. 
In ^n]we show that, at least within the slow-roll approxi- 
mation, this parallel is exact; the scalar field equation can 
be interpreted as the eikonal equation for a light ray in a 
medium with varying refractive index — or equivalently as 
Huygens' equation for a wavefront. 

In S flllGfiTDl we introduce the idea of Jacobi fields 
and explore their connection with the "adiabatic limit," in 
which all isocurvature modes decay and the curvature per- 
turbation becomes conserved. Such limits are important 
because an inflationary model is predictive on its own only 
if the flow enters such a region Ilia. l20l - [22fl . Jacobi fields 
are familiar from the description of congruences of light 
rays in general relativity ||23L I24TI . In this case, as shown 
in Fig. [TJ they describe fluctuations between the L -sized 
patches which make up the ensemble. Their evolution en- 
ables the adiabatic and isocurvature modes to be tracked. 
In particular, decay of isocurvature modes means decay 
of the corresponding Jacobi fields, which occurs when the 
bundle of trajectories undergoes focusing. 

In fllnl we use these ideas to develop evolution ("trans- 
port") equations for each correlation function, and in 3171 
we show that the Jacobi fields can be used to formally inte- 
grate the system of transport equations. This gives a prac- 
tical method to identify regions where the flow becomes 
adiabatic. The analysis can begin from either the separate 
universe principle or traditional cosmological perturbation 
theory. As a by-product, our formal solution demonstrates 
that the transport equations are equivalent to the Taylor 
expansion algorithm introduced by Lyth & Rodriguez. 

In fllVBI we use this solution to derive a closed set of dif- 
ferential equations for the Taylor coefficients, and in 3 IV CI 
we explain how the transport equations can be manipu- 
lated to obtain evolution equations for the coefficients of 
each momentum "shape". These shapes will be an impor- 
tant diagnostic tool when comparing inflationary models 
to observation [25]. Together with the transport hierarchy 
of 3lHl the equations of § 3 IV ByiVCl represent the principal 
results of this paper. Either set can be used to obtain the 
correlation functions of a given theory, and we discuss their 
comparative advantages. 

In SlIVDI we give more a more general discussion of the 



final surface 



Jacobi field 



initial surface 



conserved density p 




conserved density p 



characteristic curves 



FIG. 1: Jacobi fields. Characteristic curves are labelled by arrows, and the red characteristic is the fiducial curve. A conserved probability density is dragged along the flow. 
At any point, the Jacobi fields span the space infinitesimal displacements to neighbouring characteristics. 



relationship between the transport equations and other for- 
mulations of perturbation theory. 

In ^V|we specialize to the slow-roll approximation and 
use ray-tracing methods to derive the gauge transformation 
between field fluctuations and the curvature perturbation, 
f . As a result, we obtain the gauge transformation in terms 
of geometrical quantities — in particular, the extrinsic cur- 
vature of constant density hypersurfaces. We separate the 
gauge contribution to / NL into a number of effects, corre- 
sponding to these geometrical quantities. For some mod- 
els, we show that the largest of these can be attributed to 
a strong relative enhancement of the power in isocurvature 
fluctuations. We briefly discuss what conclusions can be 
drawn regarding the asymptotic magnitude of |/ NL |. 

Finally, we provide a brief summary of our results in SfVil 

Notation and conventions. — We use units in which c = 
ft = 1, and work in terms of the reduced Planck mass, 
M p ~ 2 = 8nG. We use a number of index conventions which 
are introduced in the text. See especially the paragraph In- 
dex convention on p.[5j and the discussion of primed indices 
below Eq. ([31]) on p.fTol 



II. GEOMETRICAL OPTICS IN PHASE SPACE 

Throughout this paper, our discussion will apply to an 
inflationary phase which can be described by a collection 
of canonical scalar fields 4> a coupled to Einstein gravity. 
We initially use Greek labels a, /3, . . . , to label the different 
species of fields. The action for this system is 



d 4 x 



d a <p a d a <p a -2v), (1) 



where V = V[cj) a ) is an interaction potential depending 
only on the scalar fields, and indices a, b, ..., run over 
space time dimensions. We take the background geometry 
to be flat Friedmann-Robertson-Walker with scale factor 
ait). 



Hubble parameter. We introduce an e -parameter for each 
species of light field, 



e„ = 



1 4> 2 

2M p 2 H 2 ' 



(2) 



in terms of which one can write e = ^ a e a . The slow-roll 
approximation therefore entails e„<l. 

Huygens' equation. — Combining (O and the field equa- 
tion for (p a , and making use of the slow-roll approximation, 
we find 



dJV 



±M P 



-M 2 d a \nV, 



(3) 



where d/V = din a measures the number of e-foldings of 
expansion experienced along the trajectory, and d a denotes 
a partial derivative with respect to cj> a . Eq. ([3]) constrains 
the canonical momenta ~ 4> a to lie on a submanifold of the 
classical phase space coordinatized by the fields a . This 
simplification is a consequence of the slow-roll approxima- 
tion. In a theory with M scalar fields, it implies that we may 
work with the simpler M-dimensional field space instead of 
the full 2M -dimensional phase space. This is convenient, 
although when we later abandon the slow-roll approxima- 
tion we will have to return to phase space. 

In what follows we often rewrite ([3]) in the form 

d(p a . V 

—— = u a , where u a = -M;dJ and / = In — , (4) 

aN V^ 

and interpret the solution <p a {N) as an integral curve of the 
vector field u a , parametrized by JV. The scale V* is arbitrary. 
Since / is a gradient, these integral curves correspond to 
pure potential flown 

The unit vector parallel to u a is 



where we have defined 



M P v' 



V2e. 



(5) 



(6) 



A. Slow-roll approximation: rays on field space 

In this subsection we impose the slow-roll approxima- 
tion. This requires e = —H/H 2 <C 1 where H = a/ a is the 



; Since d//d/V = u a d a f = -\\u a \\ 2 /M^, it follows that / is monotone 
decreasing along each integral curve. Therefore one may loosely think 
of / as a Lyapunov function (or Morse function) for the flow. 



It follows that the arc length along an integral curve, la- 
belled s and measured using a flat Euclidean metric on field 
space, satisfies ds = M P vdiV. Reparametrizing each curve 
in terms of s, the flow equation Q can be rewritten 



d0 o 



v =-M P d a lnV = d a S, 

ds 



(7) 



where S = —M P f is Hamilton's characteristic function. 
Eq. (0 is Huygens' equation. Under the assumptions of ge- 
ometrical optics, it describes the propagation of a light ray 
in a medium of spatially varying refractive index v. 

SnelPs law. — We conclude that the inflationary trajecto- 
ries in field space are precisely the light rays of geometri- 
cal optics, for which Huygens' equation can be thought of 
as a generalization of the Ibn Sahl or Snell-Descartes law. 
The wavefronts correspond to level sets of the characteris- 
tic function S and are therefore equipotentials, or surfaces 
of constant energy density in field space. Each light ray is 
locally orthogonal to these surfaces, so the vector h a is lo- 
cally the unit vector normal to a surface of constant energy 
density. 

When slow-roll is a good approximation v is small, v <C 
1, and increases to v ~ 0(1) near the end of inflation. 



B. Rays on phase space 

In some circumstances the slow-roll approximation is not 
available. This may be the case during inflation if slow- 
roll is temporarily violated — perhaps during a turn in field 
space, to be studied in 9VBJ — or on approach to the end of 
inflation, where e ~ 1. 

In such cases we must return to the full second-order 
field equation, which cannot be written in the form of 
Eq. ([3]). To obtain an analogue of geometrical optics one 
must pass to a Hamiltonian formalism. We define 



d<ftq 

dJV ' 



(8) 



This plays the role of Huygens' equation for <fi a . In terms 
of p a , the scalar field equation becomes 

w =[e(p) - 3]p «-YK^r- (9) 

We must also rewrite e and H in terms of p a , obtaining 



H 



H{<t>,p?Ml 



PaPa 
2M p 2 

3-e(p)" 



(10a) 



(10b) 



Note that e is purely a function of p a , whereas H is a func- 
tion of both 4> a and p a . 

Eqs. dS])-© show that, beyond slow-roll, the precise 
analogy with Huygens' equation is lost. Although these 



equations define a congruence of rays in phase space, it 
is not possible to find a characteristic function S so that 
these rays are everywhere orthogonal to equipotentials of 
S. Such a function would have to satisfy 8^ S = p a , and 
therefore S = p a (j) a + g(p) for arbitrary g. Unfortunately, 
there is no choice for g which reproduces the right-hand 
side of Eq. $9$. 

The majority of our analysis requires only the first-order 
evolution equations I©-©, and at this level the formalism 
we develop will apply to evolution in phase space with- 
out imposing slow-roll. For that purpose it is convenient to 
combine cf> a and p a into a single phase-space coordinate. 
We continue to write this a , with the understanding that 
a now ranges over the 2M dimensions of phase space. The 
velocity vector is likewise u a . 



C. Jacobi fields and beam cross-sections 

To proceed, we must carry the initial distribution of oc- 
cupation probabilities along the flow, forming the "image" 
distribution of interest. In optical language, our task is to 
understand how images generated from a source of light 
rays are distorted by passage through a medium. 

It was explained above that the typical spacing between 
arbitrarily selected members of the ensemble should be 
roughly of order the quantum scatter, a ~ (5cj) 2 ) 1/2 . Be- 
cause cr/Mp ~ 10~ 5 <C 1, this is small in comparison with 
the natural scale M P . Therefore the orbits traversed by the 
cloud trace out a narrowly-collimated spray or "bundle" 
of light rays in phase space. In canonical models of in- 
flation, setting initial conditions near horizon-crossing will 
make the initial profile close to Gaussian pal . Therefore 
the evolution of the ensemble is similar to the evolution 
of tightly-focused Gaussian laser beam propagating in an 
optical cavity. 

Connecting vectors. — Cross-sections within the laser 
beam may be focused, sheared or rotated by refraction. 
These possibilities are familiar from the study of weak grav- 
itational lensing. 

To obtain a quantitative description we slice the laser 
beam open, generating a cross-section. The precise slic- 
ing is arbitrary For applications to inflation we will often 
slice along surfaces of fixed energy density, or after a fixed 
number of e-folds. Distortions of the cross section can be 
studied if we know how an arbitrary basis is transported 
from slice to slice. In general relativity this would be Fermi- 
Walker transport [HI] . 

Jacobi used this method to study geodesic deviation on 
Riemannian manifolds. For this reason an infinitesimal vec- 
tor propagated along the beam is called a Jacobi field. Tak- 
ing 54> a to be such a field and the flow vector u a to be 
sufficiently smooth, it will be transported by the equation 



c\N 



5 4>p 3 (S u a= u al3 5 <t>l3- 



(ID 



The quantity u a p — dpu a is the expansion tensor. It can 
be expanded in terms of a dilation 9 = tru a|3 , a traceless 



symmetric shear a a p and an antisymmetric twist a> a/3 , 



= I 5 ° 



+ °aP+U> 



aji> 



(12) 



where d = M for flows on field space, or d = 1M if we do 
not impose the slow-roll approximation and work on the 
full phase space. In either case 5 a p is the Kronecker 5. 

Optical scalars. — Dilation describes rigid, isotropic rescal- 
ing of 5cj) a by 1 + 9. It represents a global tendency of 
the light rays to focus or defocus. The shear a a p is a sym- 
metric square matrix and can therefore be diagonalized, 
yielding d eigenvalues £j and corresponding eigenvectors 
s ai representing the principal shear directions (here i is a 
label taking values 1, . . . , d; see 9VBD . The shear describes 
a rescaling of the component of the connecting vector in 
the direction s ai by a factor 1 + £j. Tracelessness of cr a/S 
implies ^. £j = 0, so expansion in one direction must be 
accompanied by contraction in another. Therefore shear 
preserves cross-sectional area. Finally, the twist co aB de- 
scribes a rigid volume-preserving rotation of 5(j) a , repre- 
senting a tendency of neighbouring trajectories to rotate 
around each other. 
It is useful to define ct 2 to satisfy 



a = —o 



a§ u a 



(13) 



Imposing the slow-roll approximation and working on field 
space, the flow is orthogonal to equipotentials of Hamil- 
ton's characteristic function. Therefore it is a pure poten- 
tial flow, for which co a p = 0. On the full phase space this 
property is lost and the twist can be non-zero. In such cases 
it is helpful to define 2co 2 = co a pco a p. Together, 9, a 2 and 
co 2 comprise the optical scalars introduced by Sachs and 
Penrose [|27tl . 

van Vleck matrix. — Eq. ( fTTb has a well-known formal solu- 
tion in terms of an ordered exponential II28IJ. T his method 
was used Rigopoulos, Shellard & van Tent [12911 . and later 
by Yokoyama et al. [ 7] - It yields an explicit (but formal) 
expression for transport of any Jacobi field along the beam, 



50 a (JV) = r a „(N,N o )50„(JV o ), 



(14) 



where 50«(N O ) is the Jacobi field on some initial slice N = 
N . Eq. ( fl4l ) describes the evolution of this Jacobi field at 
any later time N. The matrix r a/3 (JV, JV ) satisfies 



r a/ ,(JV,JV o ) = 0»exp u ap {N')dN', (15) 

J N 

where the path-ordering operator g? rewrites its argument 
with early times on the right-hand side, and later times on 
the left. We will occasionally refer to T a0 as the propagator 
matrix. It is closely related to a Wilson line. 

Index convention. — Eq. (TTSb can be simplified with the aid 
of an index convention. Up to this point we have been la- 
belling field-space indices using Greek symbols a, fi, etc. To 



avoid writing the time of evaluation explicitly, we adopt the 
convention that Greek indices denote evaluation at the late 
time of interest, N. Latin indices i, j, etc., denote evalua- 
tion at the early time N . Therefore T can be written as a 
mixed index object, T ai . 

Eq. ( fT4] l immediately implies 






(16) 



and endows this derivative with a geometric interpretation. 
It plays an important role in the Lyth-Rodriguez implemen- 
tation of the separate universe approximation [J5[], where it 
appears due to a Taylor expansion in the initial conditions 
local to each L -sized patch. In this formulation, one often 
projects on to equipotential surfaces in field space. We de- 
fine h a n = d (j) c I d (p b to obtain 



8_K 



: hnaTtt 



(17) 



The notation 'c' indicates that c\cfi c a can be thought of as 
the variation of a field d>i defined on a fixed comoving 
spacetime hypersurface Il30l. l3lfl . It follows from geomet- 
rical aguments that h a p = 5 a p — n a fip, where n a is the 
unit normal to phase-space slices of constant potential en- 
ergy, defined in ([5]) . The tensor h a p is the induced metric 
(or "first fundamental form") on these surfaces. Eq. ( fTTI ) 
shows that choice of gauge is associated with projection 
onto an appropriate hypersurface in phase space. More- 
over, Eqs. JT6] >-(fT7l) show that partial derivatives with re- 
spect to (£,- are associated with propagation of Jacobi fields 
along the bundle. 

Caustics. — The matrix F ai appears whenever it is necessary 
to track the distortion of a line element along a flow, and 
has applications in fluid dynamics, general relativity and 
elsewhere |H1,[32]. DeWitt-Morette observed that, consid- 
ered as a matrix of Jacobi fields, Eq. JT6] ) was related to the 
inverse of the van Vleck matrix, introduced in the construc- 
tion of semiclassical ("WKB") approximations to the path 
integral |J33TlI^We define 



5 W g?exp\ - 




(JV') dJV' 



(18) 



3 In DeWitt-Morette I53l the proof is ascribed to B.S. DeWitt. DeWitt- 
Morette noted that the relation between Jacobi fields and variation of 
a general solution of the field equations with respect to its constants of 
integration had been known to Jacobi (ultimately leading to his devel- 
opment of what is now Hamilton- Jacobi theory), and suggested that 
this technique could be used to simplify the long calculations which 
arise when solving Jacobi's equation. Applied to inflationary correlation 
functions, the history has been reversed: the variational formulae came 
first, in the form of the Lyth-Rodriguez algorithm. This often leads to 
simple analytic results, as DeWitt-Morette foresaw. But, as we explain 
in 3IVBI this method is unsuited to numerical implementation, because 
of the small numerical tolerances required to reliably determine varia- 
tion with respect to the initial conditions. It is preferable to solve an 
ordinary differential equation, such as Jacobi's equation Hilt or 1251 . 



The van Vleck matrix is A ia = (JV - N ) d F~^, and has a 
well-known interpretation in geometrical optics as a mea- 
sure of focusing or defocusing: in particular, |detA| — » oo 
at a caustic, where light rays converge. Since (JV — JV ) is 
nonzero for N ^ N Q , a singularity in the van Vleck deter- 
minant implies a singularity in detT -1 . Applying (fT2l) . we 
conclude 



T = detr=9(JV,JV ) = exp I 0(N')dN'. (19) 



detr 



Therefore 6 — » at a caustic. This happens after finitely 
many e-folds only if 9 — > — oo during the flow. Otherwise, 
6 is decreasing in regions where 6 is negative, with large 
negative 9 implying strong focusing. Large positive 9 im- 
plies strong defocusing. More generally the propagator ma- 
trix can be rewritten in terms of 6, giving 



r a! = B(JV, JV ) 1/M 9 exp (a + <o) ap (N') W Spi. 

(20) 
The ordered exponential has determinant unity and there- 
fore does not change the cross-sectional area of the bundle. 



D. Adiabatic limit 

Caustics have an important interpretation in the flows 
describing an inflationary model. If the bundle of trajec- 
tories has finite cross section, then the ensemble contains 
members which are evolving along multiple phase space 
trajectories. These are the eponymous "separate universes" 
with their individual initial conditions. 

Under these circumstances one or more isocurvature 
modes exist. These are connecting vectors which relate the 
different <fr a within the bundle which all lie on a surface of 
fixed energy density, say S p . Their number is determined 
by the rank of h a pTp i . In the special case where the bun- 
dle cross-section decays to a point, there is a unique in- 
tersection between the bundle and S p . Therefore h a aTai 
has rank zero and all isocurvature modes disappear. In this 
limit, each member of the ensemble traverses the same or- 
bit, differing from the others only by its relative position, 
which corresponds to the adiabatic mode, £. It follows that, 
when the cross-section collapses to a point, the fluctuations 
become purely adiabatic. Elliston et al. llia l described this 
as an 'adiabatic limit'. After this limit has been reached f is 
conserved il34ll35tl . 

Flows which reach an adiabatic limit during inflation are 
no more or less likely — or natural — from the viewpoint of 
fundamental physics. But flows reaching an adiabatic limit 
are more predictive, because a perturbation in the purely 
adiabatic mode remains adiabatic long after inflation ends 
[|36fl . even during epochs for which we are ignorant of the 
relevant physics. Contrariwise, if any isocurvature modes 
remain then members of the ensemble may rearrange their 
relative positions until these modes decay. This possibility 
was emphasized by Meyers & Sivanandam [13711 : see also 



Ref. rflSTI . If the flow does not reach an adiabatic limit 
during inflation then the model is not predictive until we 
supply a prescription for the post-inflationary era, and ob- 
servational predictions can depend on this choice. 

Trivial, adiabatic and nonadiabatic caustics. — The out- 
come of this discussion is that approach to an adiabatic 
limit can be associated with convergence to a caustic. An 
early discussion of this principle, phrased almost precisely 
in these terms, was given by Wands & Garcia-Bellido [12111 . 
We conclude that 9 — » is a necessary condition for an 
adiabatic limit to occur, but as we now explain it is not 
sufficient^ A caustic can be classified by the number of di- 
mensions lost by the flow, or equivalently the number of 
null eigenvalues of the propagator T ai at the caustic. An 
adiabatic limit is the special case where T ai retains a single 
non-null eigenvalue, but h^T^ has no non-null eigenval- 
ues. We describe caustics which satisfy this condition as 
adiabatic. 

Eq. J20l shows that, were the integrated shear and twist 
to remain bounded while — > 0, then r ai — » 0. In this 
case no perturbations would survive, and we describe the 
caustic as trivial. An example is the case where u ap > is pure 
dilation. But barring an accurate cancellation of this kind, 
at least some component of (cr + o>) ap - will typically scale 
proportionally to 9 on approach to the causticjj 

Shear opposes focusing. — If the perturbations are not to van- 
ish completely, then some anisotropic effect of shear and 
twist must oppose the isotropic contraction due to — » 0. 

First suppose the twist is negligible. We assume that the 
eigenvectors of a stabilize in the vicinity of the caustic. If 
the shear has some number of positive eigenvalues A ; for 
which X { /9 has a finite, nonzero limit, then perturbations 
may survive in the subspace spanned by their correspond- 
ing eigenvalues. Tracelessless of cr implies that at least one 
eigenvalue must be negative, and perturbations in the sub- 
space spanned by the corresponding eigenvectors will dis- 
appear. Hence, at least one dimension will be lost by the 
flow. In practice it is often simpler to work directly with the 
eigenvalues of the expansion tensor u a/3 . 

If more than one eigenvalue of a is positive, then pertur- 
bations may survive in a two- or higher dimensional sub- 
space. In this case the caustic does not describe approach 
to an adiabatic limit, and we call it nonadiabatic. To obtain 
predictions for observable quantities the evolution must be 



4 One may have some reservations about this conclusion, because it 
seems to violate the Liouville theorem which guarantees conservation 
of phase-space volume. However, it should be remembered that the 
canonical phase space coordinate to which Liouville's theorem applies 
are not the field-space position and momenta which we are using. In 
particular, the canonical momenta will typically include powers of the 
scale factor a. 

5 In principle u a p could contain off-diagonal terms which grow faster than 
the diagonal terms, and therefore 9 . In this case there could be a sub- 
space of growing perturbations. If the growth is exponential this usually 
signals an instability, and the formalism we are describing becomes in- 
valid. 



continued. In practice this would require introduction of 
a reduced phase space describing only the surviving per- 
turbations. The flow can then be followed in this reduced 
phase space until a further focusing event occurs. This may 
itself be an adiabatic limit, or might simply describe further 
reduction in the phase space. One should continue in this 
way until an adiabatic limit is finally achieved. An exam- 
ple of this behaviour could occur soon after the onset of 
slow-roll inflation. In the early stages, independent fluc- 
tuations in the field velocities survive. But when slow-roll 
is a good approximation these will be exponentially sup- 
pressed, making 6 become very small. One should there- 
fore replace the full description by a reduced phase space 
which includes only field perturbations. In doing so one 
arrives at the field-space description of slow-roll inflation 
given in 311 Al 

Twist opposes focusing. — In slow-roll inflation, which we 
discuss in 911 El below, a diverging shear is the only mecha- 
nism by which perturbations can survive on approach to a 
caustic. Where the twist is non-zero, which occurs when we 
do not impose the slow-roll approximation, more possibili- 
ties exist. Ultimately these must be addressed, to describe 
approach to an adiabatic limit when slow-roll is no longer a 
good approximation, but we defer this discussion for future 
work. 

I 



E. Focusing in the slow-roll approximation 

In this subsection we give a more detailed discussion of 
the approach to a caustic during an era of slow-roll infla- 
tion. 

Raychaudhuri equations. — Parametrizing each trajectory 
by e-folding number N, Eq. ([4]) constitutes an autonomous 
dynamical system. Therefore a derivative along the flow 
can be written d/dJV = u a d a . In the absence of a nontrivial 
field-space metric all derivatives commute, and therefore 
[d Y , dp]u a = 0. Contracting with u Y and rearranging terms, 
one finds 



du 



aji 



AN 



dp a a ~ u a r u rP> 



(21) 



where a a is the acceleration vector, defined by a a = 
du a /dN = UpdpU a . For a potential flow, this can be sim- 
plified; comparison with Eq. ([4]) shows that 



Mi 



■dy 



(22) 



where, as above, v is the local refractive index. 

The evolution equations for the dilation and shear can 
be written 
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(23a) 
(23b) 



These are commonly known as the Raychaudhuri equa- 
tions. An equation for the evolution of the twist could be 
found in the same way, but is not needed in the slow-roll 
approximation. 
We have defined M , aS to be the Hessian of v 2 , 



^ a p= 2 d * d P v2 > 



(24) 



and #€ is its trace. Because the Hessian measures the local 
curvature of a function, one can regard #€ a p as a measure 
of the curvature of surfaces of constant refractive index in 
field space. 

Jacobi equation. — Eq. ( fTTT) shows that Jacobi fields ori- 
ented along eigenvectors of u a/3 with positive eigenvalues 
grow, whereas those oriented along eigenvectors with neg- 
ative eigenvalues decay. 

We can find an alternative description in terms of the re- 
fractive index v. Taking a derivative of (fTTT) along the flow 
and using the Raychaudhuri equations to eliminate deriva- 



tives of the dilation and shear yields the Jacobi equation^] 
d 2 5cj> a 



dJV 2 



■■M$W ap 54> p . 



(25) 



It follows that the behaviour of the Jacobi fields is de- 
termined by the curvature of v 2 , considered as a func- 
tion in field space. (Note this is related to, but not the 
same as, the curvature of surfaces of constant v.) Qualita- 
tively, Jacobi fields oriented along eigenvectors of #€ a p with 
negative eigenvalues — directions of negative curvature — 
will have quasi-trigonometric solutions. These will pass 
through zero, corresponding to the collapse of some Jacobi 
fields to zero length. Fields oriented along eigenvectors 
with positive eigenvalues will have exponential solutions. 



6 When using Jacobi fields to study geodesic deviation on a Rieman- 
nian manifold, this equation takes the form 5<$> a = -R a h(Sn& <t> p > where 
R-anfih = " p " CT ^ap/?o- ls a component of the Riemann curvature pro- 
jected along the tangent to the geodesic. 



Unless the initial conditions are precisely adjusted, these 
will typically grow. 

Focusing theorem. — By adap ting the geodesic focusing 
theorem of general relativity 112311 we can determine the 
circumstances under which focusing will occur after finitely 
many e-folds. Pick a point on the flow where the expansion 
is negative, with value 9+ < 0. Inspection of (123aD shows 
that, if #e < 0, then 6 -> -oo within AJV = M/|0J e-folds, 
where M is the dimension of field space. Any point where 
6 = —oo is a caustic, because on arrival at this point 8 = 0. 

Since Morse's lemma implies that J*? is negative in a 
neighbourhood of any local maximum of the refractive in- 
dex, v 2 = 2e, one might hope to associate such local max- 
ima with terminal points for inflation at which an adiabatic 
limit would be nearly achieved. 

However, the conditions of the focusing theorem are not 
satisfied for typical potentials. More usually the slow-roll 
approximation forces all fields to settle into a terminal vac- 
uum increasingly slowly, requiring an infinite number of 
e-folds to reach 6 = 0. Moreover, in practical examples the 
slow-roll approximation will break down and inflation will 
terminate long before the caustic is reached. Therefore we 
should not expect to achieve precisely 9 = during infla- 
tion. Nevertheless, a model may be sufficiently predictive if 
the flow spends enough e-folds in a region of large negative 
8 that 8 is exponentially suppressed before inflation ends. 

In simple potentials it is often clear when £ ceases to 
evolve. But for more complicated potentials the situation 
may not be so clear. Within the slow-roll approximation, 
this discussion shows that > 1 can be taken as a clear 
indication that isocurvature modes are still present. Their 
future decay is likely to influence £ and the outcome of 
any calculation which terminates with 6 > 1 should not 
be considered a prediction for observable quantities. Con- 
versely, 9 <C 1 is an indication that some decay of isocurva- 
ture modes has taken place. The precise nature of the de- 
cay must be deduced from the behaviour of the shear and 
twist. If perturbations survive only in a one-dimensional 
subspace than we can infer that the isocurvature modes 
have decayed to the point that £ will be approximately con- 
served. 

Example: quadratic Nflation. — We illustrate these ideas 
using the quadratic approximation to Nflation 11381. 139 
The potential is 



On the inflow trajectory, the dilation satisfies 



V 



2 2 



~<+l 



(26) 



This model is of interest in its own right, but also describes 
the approach to a generic stable minimum after suitable 
choice of field space coordinates. We suppose that there 
is at least a modest hierarchy among the masses, and or- 
der these so that m a < nip if a < [3. The most massive 
field will settle into its minimum first, followed by the next 
most massive field. Therefore approach to the final mini- 
mum will be described by a trajectory on which only cf> 1 is 
dynamical, with all other 4> a approximately zero. We de- 
scribe this as the "inflow" trajectory. 



linf , 




(27) 



The minimum cj) 1 = is a caustic, but as discussed above 
it cannot be reached after finitely many e-folds (within the 
slow-roll approximation) . The expansion tensor satisfies 
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(28) 
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This has one positive eigenvalue and the rest negative, so 
we expect it will correspond to an adiabatic limit. 

The (1,1) component of r inf diverges near the caus- 
tic. This does not signal an instability, but only that <5</> : 
grows at precisely the required rate to give constant £ ~ 

Ordered exponentials such as (TTSb satisfy a composition 
property, allowing the integral over the inflationary trajec- 
tory to be broken in two. (See Fig. [2]) The first component 
is an integral from the initial point until the onset of the 
inflow trajectory. We take this to occur at (f> 1 = (p\, and 
choose (f>* so that (1281) is a good approximation there. The 
propagator at this point is r* . . It is a complicated weighted 
average over the trajectory, and cannot usually be calcu- 
lated analytically. The second component is an integral 
over the inflow trajectory, which we denote r 1 ^. There- 
fore r ai = (r inf r*) ai . The inflow part can be computed 
from 



4>i 



->inf , 
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(29) 
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Except perhaps for special choices of initial conditions, 
Eq. ( [291 gives rank r = 1 at the caustic. Therefore this 
is an example of an adiabatic caustic. 

In more general circumstances, it may be necessary to 
diagonalize u 1 ^ before integrating over the inflow tra- 
jectory. This is reminiscent of the introduction of scal- 
ing operators in a renormalization-group framework. In- 
deed, the entire analysis, and the emergence of rational 
but non-integer power-law scaling near the caustic, paral- 
lels a renormalization group analysis in the neighbourhood 
of a fix ed p oint N40T 1 ; compare also Eq. (79) of Vernizzi & 
Wands [|30fl . 



initial point 



• focus point cpj =0 

FIG. 2: Decomposition of propagator along an inflationary trajectory. Trajectories flowing into the minimum from most initial points join an "inflow trajectory" (represented 
by a dashed line) at 4> 1 = $ { ■ The precise location of the junction is initial-condition dependent. The inflow trajectory sinks into the caustic, which here is a focus 
point, giving nearly-universal behaviour in the final stages of approach. This parallels the discussion of universality in critical phenomena; however, here, the uni- 
versal region is often physically inaccessible because the slow-roll approximation breaks down in the vicinity of the focus point. The remaining part of the trajectory 
(represented by a solid line) is non-universal, and typically cannot be calculated analytically. 



Focusing in double quadratic model. — Away from the in- 
flow trajectory it is usually necessary to proceed numer- 
ically. In Fig. [3] we show the evolution of the focusing 
parameters in the well-studied model of double quadratic 
inflation (H^H [H, [30I EH 51] . The potential is V = 
m 2 (p 2 /2 + m 2 <f> 2 /2. We choose the mass ratio m 1 /m 2 = 9 
and set initial conditions 4> 1 = 8.2M P and (p 2 = 12.9M P . 

Initially the evolution is mostly in the <p 2 direction. When 
4> 2 reaches the vicinity of its minimum there is a turn in 
field-space, which generates a spike in / NL . After the turn, 
the inflow trajectory is reached along the (p 1 direction. 

This evolution is reflected in the evolution of the bun- 
dle. Initially 9 > and the cross-section slowly dilates. 
It reaches a maximum at roughly three times the origi- 
nal cross-sectional area. After the turn, 9 rapidly drops 
to a negative value, and thereafter diverges exponentially 
to —oo. Therefore the bundle-cross section very rapidly di- 
minishes to almost zero cross-sectional area. This corre- 
sponds to an approximate caustic, and leads to an adiabatic 
limit. 

Eventually the divergence in 9 would be cut off by a 
breakdown of the slow-roll approximation, but for typical 
parameter choices 8 will already be exponentially small at 
this point. 

Example: axion-quadratic model. — Elliston et al. 111811 intro- 
duced an approximation to the hilltop region of axion N- 
flation 11391 . The Hubble rate is dominantly supported by 
many axions in the quadratic region of their potential, and 
can be approximated by a single field. A few axions remain 
in the vicinity of the hilltop, where their contribution to H 
is negligible but their contribution to the three- and higher 
n-point functions in the adiabatic limit is large. 

The potential is V = m 2 <t> 2 /2 + A 4 (l - cos2nx/f). We 
set A 4 = 25m 2 f 2 /4n 2 and choose / = M P . In Fig. [4] we 



show the evolution for initial conditions <p 
* = (//2-0.001)M P . 



16M P and 



The evolution is similar to the double quadratic model. 
Initially 9 is positive and the cross-sectional area grows. 
At its peak, it is more than 200 times the original cross- 
section. Eventually <$> approaches its minimum and the 
Hubble friction decreases to the point that % can evolve. 
It rolls away from the hilltop, eventually ending inflation. 
During this phase 9 switches sign, ultimately diverging ex- 
ponentially to —oo. Therefore we approach an adiabatic 
limit. However, Fig. [4c] shows that the rate of approach is 
quite slow. The cross-section decays softly, and by the end 
of inflation ~ 10~ 3 . Therefore an approximate adiabatic 
limit is reached and we can expect the observables to be 
roughly conserved through the post-inflationary evolution. 



III. TRANSPORT EQUATIONS 

We now apply these ideas to obtain evolution (or "trans- 
port") equations for the correlation functions in a fixed, 
comoving spacetime volume. In this section our analysis 
will be general, and can be applied to any perturbations 
whose evolution equations can be expressed in the form of 
Eq. (f3lT) . If necessary this can be achieved as described 
in |II] by passing to a Hamiltonian framework. It follows 
that the transport of correlation functions is most naturally 
expressed in phase space. 

Connecting vectors. — Consider the set-up described in 3D 
in which a comoving spacetime region of size \x is smoothed 
into separate universes of size L. Pick any one of these L- 
sized regions, which we take to be at spatial position x. 
The separate universe approximation asserts that the evo- 
lution of the smoothed fields in this region is given by the 
flow equation ([4]). We denote the difference between these 
values and those in some other region, located at position 
x + r, by 5</> a (r). This is a connecting vector in the sense of 
Eq. (fTTTl . Taylor expanding u a , the corresponding deviation 
equation is 



d50 g (r) 
diV 



u ap [4>{x)]5cj) p (_r) + -u a/3r [<Kx)] {5<^(r)50 r (r) - (5^(r)5<^ r (r))} + • • 



(30) 
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(a) Dilation ( 



fb) Integrated dilation j 8dN 



(c) Focusing © = e 



JfldN 



FIG. 3: Dilation, integrated dilation and focusing parameters in the double quadratic inflation model V = }rm\§\ + \m\<&\. The mass ratio is m 1 /m 2 = 9, and the initial 
conditions are <p 1 = 8.2M P , 4> 2 = 12.9M P . All plots are against the e-folding number N, measured from horizon exit of the mode in question. 
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fb) Integrated dilation j 6dN 
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FIG. 4: Dilation, integrated dilation and focusing parameters in the axion-quadratic model V = ^m 2 (j> 2 + A 4 (l — cos2nx I f)- We have set A 4 = 25m 2 f 2 /4tz 2 and / = M P 
The initial conditions are 4> = 16M P , x — C//2 - 0.001)M P . In (c), the inset panel shows the evolution of near the end of inflation. All plots are against the 
e-folding number N, measured from horizon exit of the mode in question. 



We assume (5</> a (r)) = and have subtracted a zero-mode to preserve this throughout the motionQ The tensor u a p 
was defined in ( fTTT) . and u a p r = d y u a p. We describe them, together with higher-index counterparts obtained by further 
differentiation, as u-tensors. They inherit a dependence on x through evaluation at <p a = <£ a (x). After transformation 
to Fourier space, the subtractions in Eq. (f30l correspond to discarding disconnected correlation functions. Therefore 
statistical properties of the ensemble do not depend on our choice of fiducial point. 

If ju/L is not superexponentially large, we can typically expect |5$ a (r)| to be small and slowly varying. In Fourier space, 
this implies that 5$ a (r) is constructed from only a few soft, infrared modes which we label k. The remaining modes 
have been integrated out in the smoothing process used to obtain this effective, separate-universe description. Working 
explicitly in terms of these modes, Eq. J30l yields a connecting vector and deviation equation for each combination of 
species and k-modaj 



d5cp a > 1 f 1 

: u a ,p(x)5<t> pl + -u a , p y(X) [54> p ,54> f - (54> p ,54> f )l + ■ 



diV 



(31) 



Eq. (|3T1) has been written in an abbreviated "de Witt" nota- 
tion, in which the primed, compound index a' carries both 
an unprimed species (or "flavour") label a and a momen- 
tum k a . Contraction over primed indices implies summa- 



7 In the language of Feynman diagrams, this would correspond to remov- 
ing contributions arising from disconnected pieces. This procedure is 
routine in applications of the separate universe principle. 

8 In Eq. <3H we are keeping nonlinear terms in the evolution equation. 
We use the term 'SJacobi field" to refer to infinitesimal connecting vec- 
tors, for which only the linear term need be kept. 



tion over the flavour label and integration over the momen- 
tum label with measure d 3 fc. The 2- and 3-index u-tensors 
appearing here satisfy 



"a'j3'M- 5< X> _ V U a/3M (32a) 

"a'jsyW - (2n:)" 3 5(k a - k p - k r )u a p r (x). (32b) 

Eq. (1301) was given by Yokoyama et al. |@] in real space, 
and used to obtain evolution equations for the momentum- 
independent Lyth-Rodriguez Taylor coefficients. We ex- 
plore the relationship between our approaches in Ap- 
pendix |A) However, Yokoyama et al. did not interpret 
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u a p as the expansion tensor of the flow or give the lc- 
space equations (f3Tb and H32aD -( [32b1 ). As we will see, 
this k-dependent information is necessary to obtain trans- 
port equations for the full set of coupled k-space correlation 
functions. 

One can arrive at the same conclusions using cosmologi- 
cal perturbation theory. Taking the background value of <p a 
to be the average field over the unsized box, the perturba- 
tions within the box are 50 a (r). One should now interpret 
r as a coordinate relative to the box. Restricting attention 
to the infrared modes in 5<£ a (r), for which k/a is negligi- 
ble, we recover Eqs. ([31]) and ( I52al) -( l32b1) . 

Correlation functions. — The full set of connecting vectors 
contains all information required to determine evolution 
of the bundle, and therefore the evolution of all statistical 
quantities. In Eq. ( f3lT) this data is carried by the u-tensors. 
The transport equations for correlation functions are simply 
a reorganization of this information. Therefore they must 
also be expressible purely in terms of u-tensors. Since (l3~TT ) 
shows that these tensors can be obtained by the separate 
universe argument or traditional perturbation theory, it fol- 
lows that they will make equivalent predictions. 

There are multiple ways to organize the u-tensors to pro- 
duce evolution equations. In Ref. [8i], transport equations 
were obtained after postulating a conservation equation for 
a probability density P, 



As in any effective theory, the transport equations will be 
useful only if a reason can be found to systematically ne- 
glect an infinite number of terms. Applied to inflation, the 
statistical properties of the ensemble are nearly Gaussian: 
in the simplest models, an n-point function will typically be 
of order H m(n) , where m{n) is the smallest even integer at 
least as large as n [45]. This is suppressed compared to 
the natural scale M P by (H/M P ) m( " :) < 1. However, this is 
not necessary; all that is required (or suggested by obser- 
vation) for ( |34l ) to be valid is that the three- and higher n- 
point functions are substantially smaller than the two-point 
function. 

Eq. ( 1341 ) was given in Ref. [@] for an arbitrary n-field 
model, but with E a >g/ interpreted as the real-space cor- 
relation function. The single-field case is discussed by 
Gardiner flU]. With the u-tensors given in fl32aD -( f32b~b . 
Eq. ( [341 ) applies for the full k-dependent correlation func- 
tion. 

Three-point function. — We write the three-point function as 
a a ipi y i = (5<£ a '5(^'5(/y). Keeping contributions of order 
0(S 2 ) and 0(a), we conclude 



da a 'j3y 



x'X' 



a A'/3Y + u a'AV^A'/3'^uY 



dJV ~" A ~" A ^ r ' ' " a ' A M ' M r (35) 

+ cyclic (a' -> j3' -* /) + [> 4 p.f.]. 



dP 



(33) 



Evolution equations for the moments of P were extracted 
using both a Gauss-Hermite expansion, and generating 
functions. Here we describe a third, simpler method. Pro- 
vided the perturbations can be treated classically, we expect 
d<0)/diV = (dO/dJV) for any quantity 00 

Two-point function. — We write the two-point function as 



J a'j8' 



{54> a '8<j)a>}. Eq. (|3TT ) implies 



dN 



d5 4v s , , s . d5( V 



■ U a y2_, r ^. 



+ iifl V E rV + |>3p.f.] 



(34) 



{u,EW + [>3p.£] 



where {A,B} is the matrix anticommutator of A and B, and 
[^ 3 p.f.] denotes terms including three-point functions or 
above which have been omitted. In general, the transport 
equations will couple correlation functions of all orders. 
They can be thought of as a limiting case of a Schwinger- 
Dyson hierarchy, applied to expectation values rather than 
the in-out amplitudes of scattering theory. Calzetta & Hu 
argued that the result could be interpreted as a Boltzmann 

hierarchy [SH]. 



This equation both implies and is implied by conservation of probability, 
Eq. 133) 



In simple models, the scaling estimate (5(f> n ) ~ H m< - n ^ 
makes both terms the same order of magnitude. For (|35T ) 
to be valid requires the 4-point function to be substantially 
smaller, which is also supported by observation 04711 . 



IV EVOLUTION OF CORRELATION FUNCTIONS 

A. Solution of the transport hierarchy by raytracing 

The transport equations ( f34l) and (f35T ) can be solved us- 
ing the machinery developed in 3HJ The key ingredients 
are the phase-space flows which describe evolution of in- 
dividual "separate universes," and the Jacobi fields which 
connect them. The solution is formal and depends only 
on the structure described in 3lHl Therefore there is no 
requirement to impose the slow-roll approximation, and 
when written over the full phase-space our equations apply 
quite generally. When truncated to field-space they repro- 
duce the slow-roll evolution. 

Two-point function. — We write the two-point function 
T,„ibi in the form 



To'i'Tf 



•j'^i'f 



(36) 



where r is to be determined. This notation has been chosen 
because r will turn out to be the propagator matrix (fT5T ) 
for the primed indices. Indeed, J36l ) is a solution of the 
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transport equation J34l l if 

dTV _ 
dJV 

&vy _ 
dJV 



' ^a'x j'i' 



0(H 4 ) s» 0. 



(37a) 
(37b) 



Eq. ( |37aD is the equation for a Jacobi field, Eq. (fTTT ) . 

In writing ( |37bD we have assumed approximate Gaus- 
sianity, so that contributions from higher-order correlation 
functions are suppressed by at least a power of H 2 com- 
pared to the terms which have been retained. Keeping 
these terms would yield the "loop corrections" of the Lyth- 
Rodriguez formalism [flfl. l48l. I49TI . To the order we are 
working, H^y should be identified as a constant: it is the 
value of the two-point function evaluated at JV = N , where 
N is the initial time which appears in the propagator (fTST ). 
We write this constant value &>*. 



The primed propagator satisfies 



5(k a -ki)r ai , 



(38) 



where T ai is the flavour propagator (fTST ). Therefore, writ- 
ten more explicitly, Eq. d36l) becomes 



(5^ a (kJ5^(k^)) = r ai r w <50 i (k a )50 ; -(kp)) o , (39) 

where our usual convention — that Latin indices denote 
evaluation of the correlation function at some initial time 
JV — continues to apply. For the two point function, prac- 
tical calculations usually simplify if this is taken to be the 
horizon-crossing time associated with scale fc = |kj = |k^|. 
We have indicated this by attaching a subscript '0' to the 
correlation function. With this understanding, and recol- 
lecting the identification ([16]), Eq. ([39]) is the familiar "5N" 
result fid. 

Three-point function. — Similar methods can be used to 
solve for the three- and four-point functions. We write 
a a /Riyi = T^iiTaiyTYkiOLiiyki. As for the two-point function, 
the propagator matrices absorb contributions from the u a p- 
tensors. In the case of E^g/ there were no other terms, 
making the "kernel" H^y time independent. Here, the pres- 
ence of terms involving u 3-tensors provides a source for 
a iljT . WefincH 

+ 0(H 6 ), 

where, as above, 5^,- is the initial value of the two-point 
function introduced in (f36t . The estimate 0(H 6 ) for the 
truncation error, beginning with contributions from the 



10 We are allowing a^/j./ to be a function of N, which means our index 
convention must be interpreted more abstractly. The expressions for 
r-matrices to which Eq. 1401 1 leads, such as Eqs. I |49a> -l |49b1 , can be 
interpreted in the original sense. 



four-point function, again assumes that the correlation 
functions order themselves in even powers of H. We de- 
fine the matrix r~\ to be the left-inverse of the propagator, 

r~i,r a /j/ = <5( k ; _ kf)5y. Inspection of ([38]) shows that it 
can be written 



rz 1 , = 5(fc - kjrr 1 , 



I a J ia 



(41) 



where T" 1 is the conventional matrix inverse of the flavour 
propagator, Eq. (TT5T ). In what follows it is useful to define 
a projected u 3-tensor, Uyfy, by 



Ui'j'V ~ r i' a ' u a'p'r' r p'j' r r'k'- 



(42) 



Combining ( 1381) and fl4"TT ), it follows that the explicit k- and 
flavour-dependence can be written 



u i , rk , = 5(k i -k j -k k )u ijk , 



(43) 



where the tensor Uy fc is the obvious flavour projection of 
u ijk , so that u ijk = T-^u a p r rpjT rk . 

With these definitions, Eq. J40l ) can be solved by quadra- 
ture. Up to loop corrections, we find 

rN 

avyv = A'j'k'+ Ui'm'n'(.N')^ m7 y n 'y dJV'+cyclic, (44) 

Jn 

where j^i'y k > should be regarded as the value of the three- 
point function at JV = N . The complete solution can be 
written (again up to loop corrections) 



a a'§'r' — ^a'i'^P'j'^r'k'-^i'j'k' 



+ 



I * a'm'n'^ j3'j 



1 v , ie'<y m , i'^'n , ie' ~r C 



'm'n' 1 j3' }' l Y'k'^'m' j'^n'k' 



yclic ) , 



(45) 



where the cyclic permutations exchange a — » /3 — » y . 

One can regard Eqs. d37aD -( |3"7b"l ) and ( 1441) as analogous 
to the "line of sight" integral which is used to obtain a for- 
mal solution to the Boltzmann equation in calculations of 
the cosmic microwave background anisotropies. 

The quantity T a > m > n , is defined by 



r«w = r a , f s, mv (jv0djv'. 



(46) 



Observe that Eq. ( 1461 ) is symmetric in the indices m' and 
n' . With our choices for the k- and flavour-dependence of 
its constituent quantities, it can be written 



* a'm'n' 0[_K- a K m K n Jl amn , 



(47) 



where T amn is the flavour-only object obtained by exchang- 
ing primed for unprimed indices in J46] ). Comparing 
with (fTST ). it follows that (up to matrix ordering ambigu- 
ities) T amn is the derivative of the propagator, 



d4> m d4> n 



(48) 



Eq. ( 1451 ) can now be recognized as the Lyth-Rodriguez for- 
mula for the three-point function J5D . 
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B. Flow equations for "5N" coefficients 

We conclude that the transport equations ( [341 ) and (135T ) 
are equivalent to the Taylor expansion algorithm of Lyth & 
Rodriguez for the three-point function. Also, because the 
u-tensors could equally well be derived using the methods 
of cosmological perturbation theory, all these methods will 
give answers which agree. Within this narrow reading, our 
analysis can be interpreted as a demonstration that these 
methods are interchangeable. Therefore we believe that 
statements to the effect that any particular method cur- 
rently in use has an intrinsic drawback when compared 
with another, as a matter of principle, are wrong. 

Nevertheless it is true that some approaches have advan- 
tages in practice, although no one approach outperforms 
the others in all applications. For example, as explained in 
3B in some models the Taylor expansion algorithm leads 
to very simple analytic formulae. This property has en- 
couraged a large literature studying models to which the 
method can be applied. 

In this broader context our analysis is not simply a re- 
formulation of existing results. First, as a byproduct of the 
raytracing method we have obtained explicit (but formal) 
expressions for the Lyth-Rodriguez Taylor coefficients, 

d J± = r ai = 9 exp I J u ap {N') dJV' 5 pi (49a) 






(49b) 



Analytically, the Taylor expansion method is useful only 
when a solution to Q49aD can be found in closed form. 
This has been achieved only for a limited class of poten- 
tials obeying some form of separability criteria; a summary 
appears in Ref. [18] together with references to the origi- 
nal literature. Eq. d49aD clarifies the difficulty encountered 
in obtaining analytic formulae as the difficulty of comput- 
ing closed-form expressions for a path-ordered exponen- 
tial. A sophisticated theory is available [|50l l but explicit 
expressions can usually be obtained only in special cases, 
or where the expansion tensor commutes with itself at dif- 
ferent times. It is possible that Eq. Q49aD could be used to 
extend analytic progress beyond the separable cases, but 
we have not investigated this possibility in detail. 

Eqs. Q49aD -( l4"9"b1 ) were given, in slightly different nota- 
tion, by Yokoyama et al. [7]. Because of its close rela- 
tion to the present discussion we review and extend the 
Yokoyama et al. approach in Appendix |A) 

Second, a naive numerical implementation of the Taylor 
expansion formula is unfavourable. Beginning with frac- 
tionally displaced initial conditions one must evolve the 
equations of motion over many e-folds, during which nu- 
merical noise is accumulating. Taking differences between 
these evolved solutions requires high-accuracy integration 
in order that the small displacement in initial conditions is 
not swamped by noise. The explicit solutions J49a| )-( [49bl > 
allow this naive approach to be replaced by a simple sys- 



tem of ordinary differential equations for T ai and r a y. The 
r ai equation is the Jacobi equation d37aD , after dropping 
primes on indices. The initial condition is T ai = 5 ai . The 
T ai; equation can be obtained by differentiation of j49bD . 
It is 



dr„ 



dIV 



jF/3£; + u apy^pi^rJ' 



(50) 



with initial condition r a y = 0. 



The same approach can be applied systematically to de- 
duce transport equations for any of the Taylor coefficients. 
Yokoyama et al. wrote the transport equation Q37aP for 
T ai , but did not write ( p50l for T ai j which they computed 
directly from J49bp . See Appendix |A| for a comparison. 



C. Transport of "shape" amplitudes 



The results of 9IVAI apply for arbitrary initial conditions 
S^i'f, J^i'j'k' f° r tri e two- and three-point functions. But for 
application to inflation, we will usually wish to apply them 
to the correlation functions produced in a specific model. 
In this case the fields a will be a collection of light scalars 
for which ^y and j^-y/fc' can be computed using the in-in 
formulation of quantum field theory [|26f1 . These yield very 
specific k-dependences whose amplitudes we wish to track. 

In this section, our analysis remains general and contin- 
ues to apply to the full phase space. 

Two-point function. — The two-point function is straight- 
forward. For a nearly scale-invariant spectrum we have 



] ar = (2 7 r) 3 5(k a +k /3 )-^f, 



(51) 



where k = |k a | = |k^| and the flavour matrix E a/3 should 
be nearly independent of k. Transport of £ a ^ can be ac- 
complished using ( [39] ), or simply by solving the transport 
equation J34l ) with an appropriate initial condition after 
dropping primes on indices. That gives 

^a/3 = ^ai^Pj^ij' (52) 

where 5^ is the initial value of S aj3 . The mild fc- 
dependence of ( f52l can also be obtained using transport 
techniques II51[1 . 

Three-point function. — Here, more possibilities exist. It is 
known that the 0(5^ 2 ) terms in J45l dominate whenever 
the bispectrum is large enough to be observed [|30l. I52f1 . 
Eq. J451 shows that these contributions add incoherently 
to the contribution from st^, so they can be studied sepa- 
rately. Using ( f5T1) and overall symmetry of the correlation 
function under exchange of indices, we can write 



2(27i) 3 5(k a +k /3 +k r )f^+ C 



ay 



k 3 k 3 



3*? 



+ 



a 



r \ap 



k 3 k 3 

a P. 



(53) 
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where the notation "2" indicates that the three-point con- 
tribution contains this contribution among others. The am- 
plitudes a a \p r are symmetric under exchange of /3 and j, 
but not otherwise. Using Eqs. ( !32aD , (132bD , ( 1351) and (l5TT ). 
we find the transport equation 



mula J45l , neglecting the initial contribution -fi^y*/. 



Connections between the transport and other 
approaches 



da 



a\p Y 



dJV 



u aX a X\p r + U §X a a\ Xy + u r X a a\pX j-r^ 



+ u aAp^A/3^(ir" 



If desired, we can apply the same method of formal solution 
described in 3IVAI This yields 



^a\py * amn^ j3j*- jk^'mj'-'nk' 



(55) 



In combination with (1531) this reproduces our earlier for- 



Up to this point we have shown that the Jacobi fields 
which connect "separate universe" trajectories in phase 
space can be used to solve the transport equations for the 
full set of k-space correlation functions. But as we have 
explained, the transport hierarchy is just one of many tech- 
niques for handling correlation functions. We now pause to 
examine the connections between these approaches. 

5N formalism. — In the Lyth-Rodriguez approach, or "5N 
formalism", one makes a Taylor expansion of the field val- 
ues on a final hypersurface in terms of field values on some 
initial hypersurface. Following the discussion surrounding 
Eq. (l30l . and with the same meaning for the vectors x and 
r, this can be written 



5<t> a (r) = r a! (x)50,(r) + -r aiJ (x) {S^MS^r) - {5^),(r)5^{r))} + . . . . 



(56) 



Note that, despite appearances, we are making no assump- 
tion that the evolution of 5cj) is close to an attractor. There- 
fore there is no requirement to invoke the slow-roll ap- 
proximation. It is true that the existence of an attractor 
would make the canonical momenta purely a function of 
the fields, yielding an equation with the appearance of 
Eq. (l56l . But as we have explained, by working in a first- 
order Hamiltonian formalism we can obtain expressions 
such as ( f56l) without this limitation. Therefore we allow 
the 5(p i to include perturbations of the canonical momenta 
if necessary, in which case the indices a, i, etc. range over 
the 2M dimensions of phase space. Where slow-roll is a 
good approximation we can revert to a simpler formulation 
based on field space. 

We have already remarked that the T -tensors are the 
derivatives ([16]) and ([48]). In Eq. ([56] ) the 5<$> a are all de- 
fined on spatially flat hypersurfaces. More commonly, an 
analogous expansion is made for the total e-folding num- 
ber JV, measured from a flat slice to a final comoving slice; 
we give an explicit relation in SjV] The choice of slicing sim- 
ply corresponds to the gauge in which we wish to work [ 8] . 

For ([561 to be useful, some means must be found to com- 
pute T a[ andr ay . 

Flow equations. — As a by-product of the raytracing so- 
lution, or "line of sight" integral, we obtained the evolu- 
tion equations Q37aD and (l50l . These allow the T -tensors 
to be computed easily. However, the same equations can 
be obtained directly from the separate universe formula, 
Eq. ([56]). Substituting ([561 into both the right- and left- 
hand sides of ( 1301) and separating the resulting expansion 
order-by-order, we immediately arrive at Eqs. ( |37aD and 
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( p50l) . This still does not require the slow-roll approxima- 
tion. 

Transfer matrices. — We have observed that Eq. (l30l arises 
in the k/aH — > limit of cosmological perturbation theory 
("CPT"). Within that framework, at least in the first-order 
theory, it is common to introduce "transfer matrices" which 
relate field perturbations at different times [9]- Typically 
these are chosen to be the adiabatic and isocurvature di- 
rections, but in principle any basis can be used. 

Restricting to first-order, the transfer matrix is deter- 
mined precisely by the leading term of ([56] ). or a gauge 
transformation of it. It follows that Eq. (l56l represents the 
extension of the transfer matrix to second-order (and be- 
yond), and Eqs. J37aD and (l50l give the evolution of the 
transfer tensors T ai , T ai j. Therefore the transfer-matrix for- 
malism is precisely equivalent to the separate universe pic- 
ture and traditional cosmological perturbation theory. Note 
that if the perturbations are projected onto adiabatic and 
isocurvature modes this requires use of the correct u ten- 
sors at each time step. 

CPT implies transport equations. — Finally, we show that 
cosmological perturbation theory implies the transport hi- 
erarchy with which we began. We write 



i^PJ^ij' 



(57) 



which, neglecting "loops," follows from ([561 and therefore 
either CPT or a transfer-matrix approach. Differentiating 
both sides with respect to time, recalling that 5^ is time- 
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independent, and make use of j37aj we find 
dE„ 



diV 



[y-aiJ.^iJ.i^Pj + u l3ii^ai^iij) S?iy ( 58 ) 



This gives the transport equation for E ap ., Eq. ( f34l ). A 
similar procedure leads to the transport equation for a ap < r , 
Eq. ( f34l ) . It follows that each of these approaches implies 
and is implied by the others. 



V GAUGE TRANSFORMATIONS 

To this point, the formalism we have developed enables 
the correlation functions of fluctuations in the fields and 
their momenta, 5(f> a and 5p a , to be evolved along the bun- 
dle of trajectories picked out by an ensemble of smoothed 
regions. However, by themselves these fluctuations are not 
observable. Only specific combinations are observable, of 
which the most important is the primordial curvature fluc- 
tuation £. Therefore to proceed we require expressions for 
the gauge transformation between the 5$ a , 5p a and (. 

In this section we impose the slow-roll approximation 
throughout, enabling us to work on field space and make 
use of the hypersurface-orthogonal property of the flow We 
intend to return to the general case in a future publication. 



A. Explicit transformations 

In the slow-roll approximation there is no need to track 
the momentum fluctuations 5p a , which are purely deter- 
mined by the field fluctuations <50 a . Therefore ( can be 
written purely in terms of the field fluctuations. 

On superhorizon scales, the appropriate gauge transfor- 
mation can be written as a Taylor expansion, 



C = N a 5<j) a + -N ap (5<l> a 5<l>p - {5<f> a 5<l>p)) + • 



(59) 



where all fields are evaluated at the same spatial position 
and a constant has been subtracted to set (f) = 0. The 
Taylor coefficients N a and N ap < have been given by various 
authors [l8L l53fl . Working in field space, we give a purely ge- 
ometrical derivation. This argument relies on the property 
that the flow is orthogonal to surfaces of constant density 
in field space, and therefore will not generalize directly to 
the full phase space. 

Linear term. — Consider Fig. [5a] We wish to compute the 
coefficient JV a at a field-space point x, which can be taken 
to lie on a hypersurface of fixed energy density p . We de- 
note this hypersurface E p . According to the separate uni- 
verse approximation, N a can be computed from the num- 
ber of e-folds required to flow back to E p after making a 
generic ("off-shell") displacement from x. Anticipating the 
discussion of second-order contributions, we denote this 
displacement 5(f) 1 and write z = x + 5(f) 1 . 

The number of e-folds required to return to E p must 
be computed along the inflationary trajectory which passes 



through z. In Fig. [5a] this trajectory intersects E p at y. The 
tangent to the trajectory at y is the normal vector n(y). 
Therefore the ("on-shell") field-space displacement along 
this trajectory, to first order in 5(f) 1 , is 50^ ow » —n a np5(j)l. 
The symbol '»' denotes equality up to higher-order terms 
in Scj) 1 which have been omitted, and we have adopted a 
convention in which quantities evaluated at x — such as the 
unit vector h — are written without an argument. Combin- 
ing Eqs. ([3]) and ([5]), we conclude 



5N 



1 K54>1 
M P J2e 



and therefore 



dN 



1 n„ 



(60) 



(61) 



30i Mpv^ M P v 

where we have reintroduced the refractive index v = V2e 
defined in 911 Al Eq. ([61]) is the term N a in ([59]). 

Quadratic term. — The quadratic Taylor coefficient can be 
obtained from the variation in dN /d<fr l a under a second 
generic displacement 5cj) 2 . Under this displacement the 
origin is shifted to x' = x + 5cf) 2 . Because the energy 
density at x' will typically differ from p, it lies on a dis- 
placed hypersurface E / . However, the definition of N is 
unchanged and must still be measured to the intersection 
with E p at y. We should compute the flow along the trajec- 
tory passing through z. The path z — > y' — » y is a discrete 
approximation to an integral along this flow. The calcu- 
lation should be carried to linear order in 5(f) 1 and 5<fr 2 
independently. 

In Fig. I5b[ the on-shell flow from z = x' + 5(f) 1 back to 
E p / is 5(p a . Repeating the analysis above, we find 



5^^-KA'pS^, 



(62) 



where n' a = n a (x'~) & n a + 5(pldpn a . (It is only necessary 

to work to first order in 5<fr 2 , since J62l is proportional to 
50 1 .) The on-shell flow from y' back to E p is 

We have defined A a to be the displacement to y' , 

A a = 54>l + 54> 2 a + 5r a , (64) 

and v!' a =n a (y'). The symmetric tensor K a p is the extrinsic 
curvature of E , or "second fundamental form," and is de- 
fined by K a p = h ay hp 5 d y h 5 [123 1 . It is related to the dilation 
and shear of the expansion tensor via 



K„ 



i re 



M P v V d 



(65) 



where cr 1 ^ is the projection of the shear onto the isocur 
vature subspace, cr^S = h ar hp 5 a rS . The first term in 
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54> R 



, 54,' 




(a) First order 



(b) Second order 



FIG. 5: Gauge transformations in field space 



is a linear, trigonometric approximation. The second is a 
correction for the curvature of S p . A similar construction 



could be used to obtain the Taylor coefficients at any de- 
sired order. 
After computing all appropriate variations, we find 



d 2 N 



= ~Ti—[K a p-n a &plnv-np& a \nv -———), 

M D v V ' ' ' M D 2v J 



(66) 



where 17 = dlne/dJV is the natural generalization of the 
single-field 17-parameter. It measures the variation of e 
along the adiabatic direction. To yield sufficient e-foldings, 
it must typically be small while observable scales are leav- 
ing the horizon. Defining S> a = h a d a to be a derivative 
along h a , it can be written 



2M P 
17 = @ CT e. 



(67) 



In addition, @ a = h a pd^ is a derivative in the plane tan- 
gent to S p at x. This tangent space can be interpreted as 
the subspace of isocurvature modes. Only the 17-component 
of (l66l ) depends purely on the local behaviour of the adi- 
abatic direction, and therefore the direction in field space 
restricted by the slow-roll approximation. The remaining 
terms all probe details of the isocurvature subspace. 

Dropping the distinction between Sep 1 and 5(f> 2 , Eq. (1661 ) 
is equal to JV ap . . It is symmetric even though we have not 
treated the displacements 5(f) 1 and Sep 2 equally. This is 
a consequence of associativity of vector addition, which 
makes z the same no matter in which order we apply the 
displacements. The inflationary trajectory passing through 
z is unique, so JV ap - can only depend on a symmetric com- 
bination of Sep 1 and 5<fr 2 . 



Eq. (1661 ) shows that N a p depends on the anisotropy of 
e — or, in the optical interpretation, the refractive index v. 
It also depends on the extrinsic curvature of S p , which is a 
function of the shape of the hypersurfaces of constant en- 
ergy density In particular, because n a K a a = 0, this term 
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can be interpreted as a metric on the subspace of isocurva- 
ture modes. 



B. Local mode / NL 

Two-point function. — These results can be combined to ob- 
tain the usual formulae for the amplitude of the local mode, 
/ NL . With our usual assumptions about the amplitude of 
those correlation functions we neglect, the two-point func- 
tion of £ satisfies 



(C(kiX(k 2 )) = (27i) 3 5(k 1 + k 2 )iV a iv p T ai r p 3 



+ 






(68) 



where k is the common amplitude of k x and k 2 and N a 
is the first-order component of the gauge transformation, 
Eq. ( f6lT) . Application of the chain rule to the contractions 
in J68l allows the Lyth-Rodriguez Taylor coefficients to be 
identified, 



N,= 



dN 



■N a r ai . 



(69) 



It follows that ([68J> is the standard result [5TJ. 

Three-point function. — Neglecting the initial three-point 
function si?^, the bispectrum can be computed by sim- 
ilar methods. There is an added complication from second- 



17 



order terms in the gauge transformation ( f59l ). Working from ([45]) (or (1531) and ([55])) gives 



J 



(C(k 1 )C(k 2 )C(k 3 )) = (27i) 3 5(k 1 +k 2 +k 3 )(jv a r amn +iv a/3 r am r /3 jjv 1 jv ; x,,^ 

We can make the identification 



fc?fc3 



+ 



PP 



+ 



PP 



H 6 
+ 0| — T 



'■ N a^aij +N a pr ai rpj, 



(70) 



(71) 



where N a p is the second-order term 
immediately, 



3 2 N 

N — 

. The familiar approximation for the amplitude of the local mode, f NL , follows 



:/i 



NL 



N mn N } N k ^ mj ^ nk 
{N a NM 2 



"a*- nmn'" j ^ k^mj ^nk 

(N„NM 2 



+ ■ 



N a p r am FpnNjN k Sf m j Sf nk 



— fV, rgauge 
— ->NL " r ->N 



'nl 



(JV q JV,.^ qr ) 2 



(72) 



In the final step we have divided the contributions into an 
intrinsic term, f^ L (which contains T amn ), and a gauge con- 
tribution f^ nse (which does not). The intrinsic term de- 
pends on the bispectrum of the fluctuations 5<p a . Eq. J46l ) 
shows that it depends on u a ^ r , and therefore has a mem- 
ory of the nonlinear evolution of the connecting vectors 
along the trajectory. However, it has no dependence on 
the nonlinear part of the gauge transformation. Vice versa, 
the gauge term depends on the nonlinear part of the gauge 
transformation, and only on the linear evolution of the con- 
necting vectors — that is, the Jacobi fields, in the guise of 



the van Vleck matrix (fl6l ). 

This separation was first made in Ref. |@], where it was 
shown that the gauge contribution dominated in a class of 
models known to generate large |/ NL | 11541 . We will sharpen 
this division slightly in Eqs. d84aD - j84bl ) below. 

The outcome of this discussion is that / NL could be com- 
puted efficiently by decomposing ( [721 ) into the component 
gauge transformations and T-symbols, which can be ob- 
tained using ordinary differential equations. An alternative 
approach is to work from the explicit formula ( f53l) . yielding 



(C(k 1 )C(k 2 )C(k 3 )) = (27i) 3 5(k 1 +k 2 + k 3 )(JV a JV /3 JV r a a | /3r + N ap N r N s ll ar llp S ) 



klkl 



+ 



fc?fc| 



+ 



PP 
K 2 K 3 



■S 1 



It then follows that 

5 N a N p N r a alPr 



■'N 



5 N a N p N r a al3r 



6 (JV^E^) 2 18 (JV A JV M E a/3 ) 2 



(74) 



In the final equality we have defined a a p Y by symmetriza- 
tion, 



l aj3y 



l a\Py + a p\ay + a y|ct/3- 



(75) 



Note that this combination is not normalized to give weight 
unity. Eq. ( [541 shows that it obeys the transport equa- 
tion ( [351 for the three-point function after dropping primes 
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on all indices. It was in this form that / NL was quoted in 
Refs. |@], although the derivation was given in real space 
and is not the same as the one given here. 

Gauge contribution. — There is some interest in isolating 
the gauge contribution to |/ NL |. As explained above, this 
is known to dominate in some models, including examples 
where large |/ NL | is generated during a turn in field space 
II18L [541 . Comparison with J73T ) shows that it can be writ- 
ten 



_ rgauge 
c/NL 



N ap N r N 5 T, ar T, (i 



Combining (16TT ) and 



(JV A JV M £ A/ J 2 

gives an explicit expression, 



(76) 



rgauge 
'NL 



1 
2 



■M P V 



(aScpjK^id^fsa) (a5(j) a ) © a lnv 



{crcrf 



{era 



(77) 



where we have defined (era) = ft a n«E a n, and (cr5<fi 



hpH a iJ. This expression is covariant under rotations of the 
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isocurvature plane. However, its form suggests a natural 
coordinate basis in which its content is more transparent. 
Since K a p is symmetric, it can be diagonalized. Its eigen- 
vectors form an orthonormal basis directed along the prin- 
cipal curvature directions of the fixed energy-density hyper- 
surface in field space. We label these eigenvectors with an 
index m and denote them £™, which can be considered as 
a vielbein. We can refer to the corresponding isocurvature 
directions as the principal isocurvature modes. The corre- 
sponding eigenvalues of the second fundamental form are 
the principal curvatures k m . 

Next, we define a correlation coefficient p m between the 
m th principal isocurvature mode and the adiabatic direc- 
tion, 



Q{a54> a ) = {am)=p n 



(ctct) 112 (mm) 1/2 . 



(78) 



It is also useful to define analogues of the 17-parameter for 
the isocurvature directions. It is a matter of convention 
how this is done. By analogy with our definition of 17 in the 
adiabatic direction we set 



2M D 



In 



-e®a< 



(79) 



Unlike the adiabatic 17-parameter, these isocurvature y] m - 
parameters need not be small even if slow-roll is an excel- 
lent approximation. In this basis we find 



.gauge ^y 
-'NL o ~ / 1 '/r 



mm 



1/2 



(era) 



1/2 



-M P v^fc m p^ 



(mm) 



(era) 
(80) 
As has been explained, these depend only on the Jacobi 
fields and geometrical quantities at the time of evaluation 
for/ NL . We have not displayed the k-modes associated with 
these objects. Eq. (l80l strictly applies for roughly compa- 
rable |k|. 

There are three contributions. First, there is the adia- 
batic 17 -parameter. As explained above this will almost al- 
ways be negligible. Second, there is a weighted sum of 
7) m -parameters associated with the isocurvature directions. 
These may be individually large. Their contribution is sup- 
pressed by the correlation coefficient p m between the adi- 
abatic mode and fluctuations in the m th direction, and also 
by the "anisotropy factor" ((mm)/(o-cr)) 1/2 which mea- 
sures their relative amplitude. Third, there is a weighted 
sum of the principal curvatures. These are weighted by the 
combination p 2 i (mm)/(crcT). Therefore, this term is typi- 
cally dominant when the bundle has exaggerated extent in 
at least one isocurvature direction. 

In a two-field model, Eq. (l80l becomes especially sim- 
ple. There is only one principal isocurvature mode, and it 
is orthogonal to the adiabatic direction. Also, the second 
fundamental form K a p has a null eigenvector and therefore 
the principal curvature k is simply its trace. Comparison 
with d65l) shows that 



k = trlC 



M P v 



M-l 

1 

M 



-n„n«o-„ 



(81) 



As in SOU we have set M to be the dimension of field space. 

Non-Gaussianity at the adiabatic limit. — There has been 
considerable interest in the fate of non-Gaussianity if an 
adiabatic limit is reached during slow-roll inflation. Mey- 
ers & Sivanandam [13711 studied a class of models in which 
/ NL , g NL and t nl decay to negligible values when all isocur- 
vature modes decay, and argued that this behaviour is 
generic. However, explicit examples exist in which an ob- 
servable value of / NL persists even after all isocurvature 
modes are extinguished [Tisl. 1391. [55f1 . The separation of 
/ NL into intrinsic and gauge contributions allows us to shed 
further light on this issue. 

At an adiabatic limit we expect (mm) — » 0, and therefore 
Eq. (|80T > implies f^ uge w i?/2. The same conclusion can 
be obtained from ( f77l ) because any tensor projected onto 
the isocurvature plane (such as K a p or @ a ) is orthogonal 
to E a/3 in this limit. This is an advantage of the tensorial 
approach we have described, based on associating isocur- 
vature modes with the tangent plane to surfaces of constant 
energy density in phase space. 

One can also show that the intrinsic / NL satisfies 



/, 



NL 



: /nl + 2 



1 

2' 



(82) 



where AL' denotes evaluation just after the adiabatic limit 
is reached. In the language of 311 El this may coincide with 
the onset of an inflow trajectory. We conclude that, at any 
subsequent time, / NL has value 



f _ r<£.AL , ^AL 



(83) 



which is constant as we expect. If the adiabatic limit 
is reached during slow-roll inflation, where 17^ must be 
small, this enables us to give a more precise formulation of 
Meyers & Sivanandam's argument: if / NL is large in the adi- 
abatic limit, it must be because a large intrinsic three-point 
function is developed during the evolution. This is indeed 
the case in known examples where a large fm is reached in 
the "horizon-crossing approximation" lll8Ll39l . 

Eqs. d80]), dSU and ([53]) also enable us to sharpen the 
division between "gauge" and "intrinsic" contributions. We 
define 
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(84a) 
(84b) 



The advantage of this redefinition is that the A- and B- 
type contributions are constant at an adiabatic limit; in- 
deed, fJl is zero there because it captures only transient 
effects caused by the evolving isocurvature modes. How- 
ever, when |/ NL | is large the A- and B-type terms approxi- 
mately correspond to the intrinsic and gauge / NL . 

This division is not unique, because a total derivative can 
always be added to the time integral in r a y. However, 
the division in Eqs. J84aD -( [84bl ) seems phenomenologically 
useful because all models (of which we are aware) which 
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generate large non-gaussianity do so in one of two ways: 
either f£ L becomes large at the adiabatic limit, or f£ is 
large some time before the adiabatic limit is reached. As 
the following examples show, the underlying reason seems 
to be that the B-type term responds immediately to strong 
distortions of the shape of bundle, whereas the A-type term 
does not. 

Example: Byrnes etal. model. — We illustrate Eqs. ( [771 ), (f80l ) 
and d84aD -( [84b1 ) using examples drawn from the literature. 

Consider the model V = V (p 2 e~ Xx introduced by 



Byrnes et al. H541 . We follow their choices, setting A = 
0.05MjT 2 and fixing initial conditions <p = 16M P and % = 
O.OOlMp. The first phase of evolution is descent from a 
ridge, during which a large spike in / NL is generated by 
the gauge term. An interpretation of this contribution was 
given in Ref. iflal . 

In Fig. [6a] we plot / NL during the inflationary phase. For 
most of the evolution it is dominated by f^ uge - In turn 
/j^ uge is dominated by the extrinsic curvature term K a ^ . In 
Fig. [6b] we plot the difference between the full / NL and the 
K a p-term, demonstrating explicitly that it is small. 

In Figs. [6cH6fl we plot the bundle parameters which de- 
termine the K a p-term and the other contributions to f^ uge - 
The correlation constant is initially zero but approaches 
— 1, making the curvature and isocurvature mode (anti-) 
correlated, as first discussed by Langlois 0221 1 . The princi- 
pal curvature k and isocurvature 17 -parameter exhibit only 
modest evolution over the entire range of e-folds. In com- 
parison, the anisotropy factor ((mm^crcr)) 1 / 2 grows dra- 
matically. Its evolution is the dominant factor which deter- 
mines the evolution of / NL . A large / NL arises because the 
ensemble of separate universes becomes highly anisotropic, 
with nearly twenty-five times as much power in the isocur- 
vature direction as in the adiabatic direction. Evidently this 
must arise from a large contribution to the integrated shear 
in the propagator matrix. 

Note that, although /^ L ^ f^ uge responds immediately 
to this strong anisotropy factor, there is no corresponding 
significant enhancement of the intrinsic three-point func- 
tion. 

In Fig.[6g]and[6hlwe plot the bundle dilation, 6, and the 
focusing 0. The dilation is always positive, so the bundle 
cross-section grows monotonically. Hence the total power 
in the isocurvature mode also grows monotonically. Evi- 
dently, the spike in / NL is not due to the total isocurvature 
power, but to its relative growth compared with the adia- 
batic power. The large 8 implies that this model does not 
reach an adiabatic limit, and some other mechanism must 
be invoked to end inflation and determine the value of each 
observable. In Ref. 11541 it was assumed that sudden desta- 
bilization of a waterfall field could play this role. 

Example: axion quadratic model. — A similar phenomenon 
occurs in the axion-quadratic model discussed above. We 
plot the evolution of / NL in Fig. [7a] It exhibits three dis- 
tinct components. The first is a negative spike, generated 
by the axion rolling off its hilltop. The second is a smaller 
positive spike produced by the axion rolling into its min- 



imum. These two spikes come from the gauge contribu- 
tion to / NL , as clearly shown in Fig. [7b] Fig. [7c] shows that 
each spike is inherited from a spike in the anisotropy factor. 
This is consistent with the analysis of Elliston et al. iflal . 
in which the spikes were interpreted as due to strong de- 
formations in the shape of the bundle. In the present in- 
terpretation, the differing signs arise because the principal 
curvature changes sign in the intermediate evolution. 
As for the Byrnes et al. model, the intrinsic term /^ L ^ 

f^ L does not respond immediately to this strong anisotropy, 
growing only later on approach to the adiabatic limit. The 
anisotropy is due to a strong shearing effect arising near 
the turn from dominantly <£ -evolution to dominantly 7- 
evolution. Near the deep negative spike in / NL , there is 
an enhancement in the shear oriented parallel to the prin- 
cipal isocurvature mode. This enhances the fluctuations in 
the isocurvature direction. 

The third feature is the flat plateau at late times, associ- 
ated with the adiabatic limit. Fig. [7b] shows that this comes 
from grow th in the intrinsic term f^ L ; see the discussion in 
Refs. BUM]. 



VI. SUMMARY 

In this paper we have developed an analogy between in- 
flationary perturbation theory and geometrical optics. Here 
we summarize the main steps in the discussion. 

Background. — In inflationary perturbation theory, we are 
interested in following the statistical properties — as mea- 
sured by the correlation functions — of an ensemble of 
spacetime regions. This ensemble can be constructed 
equally well within the separate universe picture or tradi- 
tional cosmological perturbation theory 

The ensemble picks out a cloud of points in phase space. 
In the limit k/aH <C 0, interactions between members of 
the ensemble are suppressed and each point moves along a 
phase space orbit of the unperturbed system. Therefore the 
ensemble traces out a narrowly-collimated "spray" or bun- 
dle of trajectories. Where slow-roll applies, the momenta 
are determined in terms of the fields and we can work in 
terms of a simplified flow on field space. 

Optical quantities. — Geometrical properties of the bundle 
of trajectories can be used to describe its evolution and de- 
termine its statistical properties. The quantities of princi- 
pal importance are obtained by decomposing the expan- 
sion tensor, yielding the dilation, shear and twist. These 
are well-known from the description of light rays in gen- 
eral relativity. 

Jacobi fields and van Vleck matrix. — The dilation, shear 
and twist determine the evolution of Jacobi fields, which de- 
scribe infinitesimal vectors connecting nearby trajectories. 
At any point in the flow, the van Vleck matrix aggregates 
the linearly independent Jacobi fields. The Jacobi fields 
themselves are measured from a fiducial trajectory, which 
can be thought of as the eikonal of geometrical optics. This 
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FIG. 6: Bundle parameters for the Byrnes et al. model V = V 4> 2 e Xx . The initial conditions are 4> = 16M P and % = 0.001M P , and A = 0.05M P 2 . All plots are against the 
e-folding number N, measured from horizon exit of the mode in question. 



analogy is exact within the slow-roll approximation. 

We have argued that different implementations of the 
separate universe assumption — such as the Lyth-Rodriguez 
Taylor expansion, or the transport equations of floU — can 
be thought of as different methods to compute the Jacobi 
fields, in the form of the van Vleck matrix (IT6t . More gen- 
erally, the same is true for all approaches to perturbation 
theory in the limit k/aH — > 0. The most familiar imple- 
mentation of the separate universe assumption, the "5N 
formalism" or Taylor expansion approach, follows from Ja- 



cobi's method of varying a solution with respect to its con- 
stants of integration. Conversely, the transport equations 
arise more naturally from Jacobi's differential equation. 

On approach to a caustic, some number of Jacobi fields 
decay. At an adiabatic caustic, defined in 911 Dl all but one 
of the Jacobi fields decay. The single remaining field repre- 
sents fluctuations along the caustic. In inflation this mode 
is the adiabatic fluctuation. The other Jacobi fields rep- 
resent isocurvature fluctuations between the spacetime re- 
gions which make up the ensemble. Therefore, focusing at 
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FIG. 7: Bundle parameters for the axion-quadratic model; for the potential and initial conditions, see Fig.[^ All plots are against the e-folding number N, measured from 
horizon exit of the mode in question. 



an adiabatic caustic can be interpreted as decay of isocur- 
vature modes, or app roach to an adiabatic limit in the sense 
of Elliston et al. ftlW . 

Transport equations. — The "u-tensors" encode evolution 
of the connecting vector fields. We have argued that these 
tensors can be computed using either cosmological per- 
turbation theory or the separate universe approximation. 
More generally, any formalism which can reproduce the k- 
space deviation equation (f3TT) will reproduce the correct 
correlation functions, because the u-tensors uniquely de- 
termine the transport equations. Therefore the u-tensors 
may be used as an objective way to compare competing 
formalisms. 

The transport equations obtained in this way are general- 
izations of the transport equations previously introduced in 
Ref. [Jgl] . Because they are expressed in terms of u-tensors, 
it follows that they can be integrated in terms of the Jacobi 
fields and their derivatives. Therefore the correlation func- 
tions can be expressed using the van Vleck matrix and its 
derivatives. (Technically it is the inverse of the van Vleck 
matrix which appears, in the form of the propagator ma- 
trix il5\i.) 

In turn the van Vleck matrix can be expressed in terms 
of the integrated dilation, shear and twist. This makes it 
possible to diagnose regions where the flow may become 
adiabatic by tracking the behaviour of the focusing param- 
eter 0, defined in Eq. ( PT91 ), and the behaviour of the shear 
and twist. 

Working within the slow-roll approximation we have ar- 
gued that > 1 implies the presence of remaining isocur- 
vature modes. To be compatible with experiment, these 
must almost certainly decay before the surface of last scat- 
tering. The consequent transfer of power into the adiabatic 
mode can change the value of £. 

Flow equations. — The Jacobi fields yield a formal solu- 
tion for each correlation function, analogous to the "line of 
sight" used to simplify integration of the Boltzmann equa- 
tion in CMB codes. 

This formal solution demonstrates explicitly that the 
transport equations reproduce the Taylor expansion algo- 
rithm of Lyth & Rodriguez. In doing so we also obtain ex- 
plicit expressions for the Taylor coefficients F ai and T ai j in 



terms of integrals of the expansion tensor and its deriva- 
tives along the flow. Similar expressions had previously 
been obtained by Yokoyama et al. 10] ■ 

These explicit expressions can be manipulated to obtain 
a closed set of evolution equations for the Taylor coeffi- 
cients. These are Eqs. ( |37aD and J50l ). Such equations are 
extremely helpful in practice, because it means the Taylor 
coefficients can be obtained without the challenging prob- 
lem of extracting a variational derivative after numerical 
integration: without a sufficiently accurate integration al- 
gorithm, the small variation of interest can be swamped by 
numerical noise. 

Transport of shape coefficients. — Even after obtaining 
the Taylor coefficients, it is necessary to extract coefficients 
for each type of momentum dependence (or "shape," in in- 
flationary terminology) which occurs in a correlation func- 
tion. An alternative is to return to the full k-space trans- 
port equations and derive evolution equations for these 
coefficients directly. The first nontrivial case is the three- 
point function, whose shape coefficients are determined by 
Eq. ( f54l) . 

Gauge transformations. — Specializing to the slow-roll ap- 
proximation, where the flow can be described in field 
space, ray-tracing techniques can be used to obtain the 
gauge transformation to £. In this way the gauge trans- 
formation is expressed using geometrical quantities in field 
space, rather than merely derivatives of the potential. 

In models where a large / NL is obtained from the gauge 
transformation, this gives a geometrical interpretation of 
its magnitude. The contributory factors are: (1) the 
17-parameters of the adiabatic and principal isocurvature 
modes; (2) the principal curvatures of uniform-density hy- 
persurfaces in field space; (3) the correlation coefficient 
between the adiabatic fluctuations and the fluctuations in 
each principal isocurvature mode; and (4) an anisotropy 
factor which measures distortions in the cloud of field- 
space points representing the ensemble. 

In two cases where a large, transient contribution to / NL 
has been observed, we show this principally arises from a 
strong enhancement in the anisotropy factor. 

Comparison with other geometrical formulations. — In 

common with all other approaches to the evolution of cor- 
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relation functions, the interpretation described in S flTTfl JVl 
is a reformulation of perturbation theory. All approaches 
carry the same physical content. Therefore, aside from 
practical considerations, the merit of each reformulation 
arises from the insight gained by emphasis on different 
structures. 

The formulation we have given emphasizes the back- 
ground phase space manifold, which encodes the structure 
of the theory in its geometry. This geometrical structure 
is mapped out by the behaviour of the trajectories flowing 
over it. Globally, this connection is made precise by the 
methods of Morse theory. Locally, it is encoded in the Ja- 
cobi fields whose role we have highlighted. 

Attempts to reformulate perturbation theory in terms of 
geometrical objects have already attracted attention by var- 
ious authors. Gordon et al. [|56f1 and Nibbelink & van 
Tent iflOfl formulated perturbation theory for the two-point 
function in terms of the Frenet basis, which they called 
the "kinematical basis." (See also Achurcarro et al. [13].) 
Peterson & Tegmark later extended this approach to the 
three-point function II12[1 . A Frenet basis can be defined 
for each trajectory, and the Frenet-Serret equation de- 
scribes how this basis is transported along the trajectory. In 
Refs. [floL [l2l. [la. |56[1 these equations are used to describe 
transfer between the adiabatic and isocurvature modes. 

In the Frenet formulation, the isocurvature modes are 
identified with the normal, binormal, . . . , vectors. In our 
formulation these modes arise from the eigenvectors of the 
extrinsic curvature, K a p, which we have described as the 
principal isocurvature modes. The tangent plane spanned 
by the Frenet normal, binormal, . . . , is the same as the sub- 
space spanned by the eigenvalues of K a p, so the physical 
content of these formulations is the same. More generally, 
in our formulation the properties of the isocurvature modes 
are expressed using the familiar mathematical apparatus 
used to describe hypersurfaces — normal vectors, first and 
second fundamental forms, and so on. 

In addition, we explicitly separate a "local" contribution 
to each £ correlation function, arising from a gauge trans- 
formation and depending on the precise orientation of the 
Frenet basis, from the "integrated" contributions, obtained 
by solving the transport equations. Although it is clear that 
one can equally well express the integrated contributions 
in any suitable basis, it requires extra effort to rotate to the 
Frenet basis at each step in the integration. We feel it is 
preferable to express the evolution equations of perturba- 
tion theory in terms of the original basis on field space. 

Future directions. — This formalism can be extended in 
several directions. 

First, at some points in the discussion we specialized to 
the slow-roll approximation, to take advantage of certain 
simplifications — such as the twist-free and hypersurface- 
orthogonal character of flow. However, as we have pre- 
sented it, the underlying formalism is independent of slow- 
roll. It can be used to evolve both field and momentum 
perturbations. This is desirable because future data from 
microwave background or galaxy surveys will be highly ac- 
curate, demanding commensurate accuracy in our theoret- 



ical calculations. 

Second, in this paper we have interpreted the decay of 
isocurvature modes, and approach to an adiabatic limit, as 
focusing of the bundle to an "adiabatic" caustic. Our de- 
tailed discussion was restricted to field space. It should 
also be possible to study focusing and decay of isocurva- 
ture modes on the full phase space, providing a framework 
for the study of kinetically dominated scenarios, such as 
descent through the waterfall of hybrid inflation, where fo- 
cusing may also occur. 

Third, the existence of explicit expressions for the Taylor 
coefficients T ai and r ai; may enable new analytic solutions 
to be found. 

Finally, the entire formalism can be extended to higher 
n-point functions. The case of principal interest is the four- 
point function. In contrast to the three-point function, this 
requires two shape parameters which determine t nl and 

&NL- 
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Appendix A: Yokoyama et al. backwards formalism 

In 3IVBI we presented integral formulae for the 
"5N" coefficients, Eqs. J49aD -( [49bl l, and noted that es- 
sentially identical expressions had been presented by 
Yokoyama et al. [0]. (However, Yokoyama et al. obtained 
their results by very different means.) Since their work is 
closely related to our own in content and outlook, we take 
this opportunity to review and extend their results. 

Their aim is to develop evolution or "transport" equa- 
tions (in our terminology) for objects closely related to ob- 
servation, such as the derivatives JV, — defined in section 
SjV]— and the / NL parameter. They proceed as we do, first 
fixing a flat initial hypersurface. In our notation this is dis- 
tinguished with lower case Roman indices. Unlike us, they 
also fix the final slice to be the precise time at which we 
wish to know the value of each observable quantity. In our 
notation this slice is labelled with Greek letters, and we 
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obtain the matrices r ai and r„y as a function of it. Ob- 
servables can be obtained after evaluating these functions 
at the time of interest. Instead, Yokoyama et al. consider 
intermediate flat slices between the initial and final slices, 
and express their answers as a function of the intermediate 
time. As we now explain, observables are to be obtained by 
setting this intermediate slice equal to the initial hypersur- 
face. 

For convenience we extend our index notation, and label 
quantities evaluated on the intermediate slice with upper 
case Roman indices. Yokoyama et al. introduce the quan- 
tity 



N:=N a r a 



(Al) 



where N a was defined in ^V) N, is the derivative of the num- 
ber of e-folds between an intermediate flat hypersurface 
and the final uniform density hypersurface, with respect to 
the field values on the intermediate slicing. Yokoyama et al. 
introduce a further quantity © 7 = r 7l iV;. Note that 6 7 is 
not to be confused with the focusing parameter 6 defined 
in the text, which is the exponential of the integrated dila- 
tion. Nj and 9 7 obey the autonomous transport equations 



dA/j 

diV 
d9 7 

dJv" 



-UuNj, 



(A2) 
(A3) 



Evaluating JV 7 at the final hypersurface gives N a , which pro- 
vides a boundary condition for the differential equation. 
One can then evolve backwards in time until we reach the 
initial slice. At this point N, will equal N h which is the Tay- 
lor coefficient we set out to calculate. After this has been 
done, 6 7 can be evolved forwards from the initial hypersur- 
face with boundary condition 0; = N t . 

We describe this as the "backwards" formalism, to be 
contrasted with the "forwards" formalism we have de- 
scribed in the text. 

The introduction of these quantities is ingenious. Em- 
ploying Eq. ( fTTT ) together with Eq. ( 149b D yields 

N [; =JV a r a J r^u^r^jdN'+N^TaiTpj, (A4) 

Jn* 

In turn this leads to 

/•N 

N t Ni,N } =\ iv 7 u„ jc e j e fc djv' + e a e /3 iv a p. (as) 

Jn* 



Therefore, / NL can be evaluated with knowledge only of N,, 
©/ and u UK , 

In performing this calculation, Yokoyama et al. traded a 
three-index object (either r aiJ or a a |« r , depending which 
formulation is in use) for two one-index objects, N, and 
7 . This involves fewer equations and therefore can be 
numerically advantageous. 

Nevertheless, the backwards formalism has some disad- 
vantages. First, because it computes only the Taylor coeffi- 
cients, information about isocurvature modes is discarded. 
The evolution equations for T, a p and a a ^ or r ai - and T aij , 
allow the isocurvature modes to be retained. 

Second, to obtain information about the time-evolution 
of any observable it is necessary to recalculate JV 7 and 7 
with multiple final times. Although the method yields N h 
which is apparently related to the gauge transformation at 
an intermediate time, this is not quite correct. JV 7 is de- 
fined for a fixed future rather than past boundary condition, 
and therefore gives information about a range of scales at 
a fixed time of observation, rather than a fixed scale at 
a range of final times. The past-defined objects required 
for the latter are automatically provided by the forwards 
formalism, meaning that multiple integrations are not re- 
quired. 

If the time of observation is known then the backwards 
formalism gives an efficient means to treat multiple scales 
at once. 

To extend the backwards formalism to the trispectrum, 
one needs to separate the observables t nl and g NL . For 
this purpose A/y and ATyj. themselves would be required. 
Therefore, given the potential utility of this method, we 
conclude by extending it to include a backwards evolution 
equation for N u . As for the spectrum, this can be used to 
obtain information about / NL and t nl over a range of scales 
at a fixed time of observation. It still requires the solution 
for only a two-index object. The transport equation for N u 
can be shown to be 



diV 



JK 



dN 



-Uijk n i ~ UijN ik - u ik Njj . (A6) 



This is to be solved backwards from the final hypersurface 
where N u is equal to N a g . 
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